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Abstract 

We present new bounds on the locality of several classical symmetry breaking tasks in dis- 
tributed networks. A sampling of the results include 

1. A randomized algorithm for computing a maximal matching (MM) in 0(log A+(log log n)^) 

rounds, where A is the maximum degree. This improves a 25-year old randomized algo- 
rithm of Israeli and Itai that takes O(logn) rounds and is provably optimal for all log A in 
the range [(log log n)*, \/log n] . 

2. A randomized maximal independent set (MIS) algorithm requiring 0(log A-^/log n) rounds, 
for all A, and only 2'^(^'°8^ log") rounds when A = poly(logn). These improve on the 
25-year old 0(logn)-round randomized MIS algorithms of Luby and Alon et al. when 
log A <C \/log n. 

3. A randomized (A + l)-coloring algorithm requiring O (log A + 2*^ ) rounds, improv- 
ing on an algorithm of Schneider and Wattenhofer that takes 0(log A + y/\ogn) rounds. 
This result implies that an 0(A)-coloring can be computed in 2*^(^^°s rounds for all 
A, improving on Kothapalli et al.'s 0(\/Iog^) -round algorithm. 

We also introduce a new technique for reducing symmetry breaking problems on low arboric- 
ity graphs to low degree graphs. Corollaries of this reduction include MM and MIS algorithms 
for low arboricity graphs (e.g., planar graphs and graphs that exclude any fixed minor) requiring 
O(vTogn) and O(log^^^n) rounds w.h.p., respectively. 
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1 Introduction 



Breaking symmetry is one of the central themes in the theory of distributed computation. At 
initiahzation the nodes of a distributed system are assumed to be in the same state (but with 
distinct node IDs), yet to perform any computation the nodes frequently must take different roles, 
that is, they must somehow break their initial symmetry. In this paper we study three of the 
classical symmetry breaking tasks in Linial's COCAC model |2Uj : computing maximal matchings 
(MM), maximal independent sets (MIS), and (A + l)-coloring, where A is the maximum degreej^ In 
the COCAC model each node of the input graph G hosts a processor, which is aware of its neighbors 
and an upper bound on the size of the graph. The computation proceeds in synchronized rounds 
in which each processor sends one unbounded message along each edge, which may be different for 
each edge. Time is measured by the number of rounds; local computation is free. At the end of 
the computation each node must report whether it is in the MIS, or which incident edge is part of 
the MM, or its assigned color. See |26l Ch. 1-2] for an extensive discussion of distributed models. 



Prior Work. The vertex coloring, MM, and MIS problems have been the subject of intensive 
research since the mid-1980s [Il[2l|3l[9l[ini[IIl[l9l[20l[22l[2ll[28l[3l]. In 1986 Israeli and Itai [II] 
devised a randomized algorithm that computes a MM in O(logn) time with high probabilityj^ 
and the same year Luby [22j and Alon, Babai, and Itai [I] independently proposed 0(logn)-time 
randomized MIS algorithms, which can also be used to compute (A + l)-colorings in O(logn) 
time. These are the fastest known algorithms for MM and MIS on general graphs. It was recently 
shown that (A + l)-coloring can be computed faster [31j, in 0(log A + \/log n) time w.h.p. Kuhn, 
Moscibroda, and Wattenhofer |18j proved that there exist n-vertex graphs with maximum degree 
2©(v^Iogn) which any algorithm for MM or MIS (even randomized) requires Vt{y/\ogn) time. 
This implies a lower bound of $7(min{log A, yTogn}) for these and many other problems. We will 
henceforth refer to this result as the KMW hound. 

For deterministic algorithms the situation looks quite different. The fastest MIS and (A + 1)- 
coloring algorithms on general graphs run in 0(A + log* n) time [S US] and 20(VR?^) time [21 
128] . whereas the fastest MM algorithms on general graphs run in 0(A + log* n) time pT] and 
O(log^n) time [10]. For certain graph classes the bounds cited above can be improved. Barenboim 
and Elkin [3j showed that on graphs of arboricity A, MM and MIS can be computed in time 
0(logn/ loglogn) for A sufficiently small (A must be less than log^~'^n for MM and less than 
log^/^~^n for MIS.) We believe arboricity is an important graph parameter as it robustly captures 
the notion of sparsity without imposing any strict constraints. A graph has arboricity A if its 
edge set can be covered by A forests, or equivalently [25], if every subgraph has density less than 
A|^ Lenzen and Wattenhofer J9] gave a randomized MIS algorithm for unoriented trees (A = 1) 
running in 0{y/log n ■ loglogn) timej^ The MM and MIS problems on graphs of bounded growth 
have also been studied recently [9], culminating in an algorithm [31] running in 0(log* n) time on 

^The MM problem is to compute a maximal set of vertex-disjoint edges. The MIS problem is to compute a 
maximal set of vertices, no two of which are adjacent. The (A + l)-coloring problem is to assign colors from the 
palette {1, . . . , A + 1} such that no edge is monochromatic. 

^ With high probability (w.h.p.) means with probability 1 — for an arbitrarily large fixed constant c. All 

randomized algorithms cited in the paper finish their computation in the stated time bound, w.h.p. 

^Note that many sparse graph classes have A — 0(1), such as planar graphs, graphs avoiding a fixed minor, 
bounded genus graphs, and graphs of bounded degree or tree/clique/pathwidth. 

■^The claimed time was 0{\/log n log log n) but there was a flaw in the analysis. See Section[7| 
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Figure 1: A summary of upper and lower bounds for MM, MIS, and vertex coloring. Here A is 
the maximum degree and A the arboricity. All running times are randomized (w.h.p.) unless noted 
otherwise. 
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this graph class. The log* n term in the running times cited above cannot be improved, by Linial's 
lower bound [20j. 

Faster coloring algorithms are known if one allows more than (A + 1) colors. Linial j20j devised 
a deterministic 0(A^)-coloring algorithm requiring log* n + 0(1) time, which was improved to 
^ log* n + 0(1) by Szegedy and Vishwanathan [_30j • Kothapalli et al. [15] gave a randomized 0(A)- 
coloring algorithm running in 0(log A + \/log n) time, for all A, and Schneider and Wattenhofer 
[31] devised a randomized 0(A + log^"''^'^'^ n)-coloring algorithm running in time 0{k + log*n). 
Barenboim and Elkin [5] showed that A^+'^-coloring can be computed in 0(logA • logn) time 
deterministically, for any e > 0. Graphs of bounded arboricity A were shown [21 [5] to be amenable 
to faster coloring algorithms. In particular, A^^'^-coloring can be computed deterministically in 
0(logA • logn) time and 0(A)-coloring in 0(A'^ • logn) time [5], for any fixed e > 0. 

Our Results. We give a new randomized MM algorithm running in 0(log A + log^logn) time, 
improving the 25-year old bound of O(logn) [11] and 0(A + log* n) [27]. According to the KMW 
lower bound our algorithm is provably optimal whenever log A S [log^logn, y/\ogn\. 

We give a randomized MIS algorithm running in 0(log A • ylogn) time, improving the 25-year 
old 0(logn)-time algorithms of Luby |22j and Alon, Babai, and Itai [1] when log A ^ ^/Xogn. If 
A = (logn)'-^(^) we provide an even faster algorithm running in 

20(Vloglogn) -pi^ggg 

are the 

first general MIS algorithms running in sublogarithmic time for such a wide range of A. 

For vertex coloring we give a (A + l)-coloring algorithm running in 0(l0gA + 2^(Vloglogn)) 
time, improving an 0(logA + ^log n)-time algorithm of Schneider and Wattenhofer [32j . As a 
result of this, we can now compute 0(A)-colorings in 2*-^^^'°^ log") time for all A, improving the 
0(-v/logn)-time algorithm of Kothapalli et al. |15j . 

As noted above, A is a significantly more sensitive graph parameter than the arboricity A. We 
give a new technique for reducing the symmetry breaking problems on low arboricity graphs to 
low degree graphs, which is of independent interest. As direct corollaries, our reduction shows that 
MM and MIS can be solved in 0(log A -|- yTogn) time and 0(log A • yJXogn + log^^^ n) time, resp., 
on graphs with arboricity A. In particular, for planar graphs (and, more generally, for graphs that 
exclude any fixed minor), our MM and MIS algorithms require only 0(\/logn) and 0(log^'''^n) 
time, respectively. 

When A is very small we give several algorithms that are faster for certain ranges of A. For 
example, when A = 0(1) our MM, (A -|- l)-coloring, and MIS algorithms run in time, respectively, 
0(logA + 

logkigiogn)' 0(logA + log logn), and 0(log A + log logn). These time bounds are 
exponentially faster, as a function of n, over previous deterministic algorithms [3l |5]. For the 
special case of trees (A = 1) we give an even faster MIS algorithm whose running time is the 
minimum of 0{^\og n log log n) and O (log A log log A -|- log'^gkign )' '^^lich improves on [19] . 
See Figure [T] for a comparison of our results with prior work. 

Technical Summary. All of our algorithms take the following two-phase approach. In Phase I 
we use some iterated randomized procedure that, with high probability, finds a large partial solution 
(a matching, independent set, or partial coloring) that effectively breaks the global problem into a 
collection of disjoint subproblems with poly(logn) size or 0{yJ\ogn) diameter. In Phase II we solve 
each subproblem using the best available deterministic algorithm. It is for this reason that our 
running times are usually exponentially faster in terms of n than the best deterministic algorithms, 
e.g., a 2'^(v^) bound becomes 2'^(^i°gi°g"), a , '"p" bound becomes , '°g'°f " and so on. This 

° ' ' log log n log log log n 
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strategy has been used in other contexts, for example, in Beck's [6] algorithmic approach to the 
Lovasz Local Lemma, the local hypergraph coloring algorithms of Rubinfeld et al. [29], and the 
0(A)-coloring algorithm of Kothapalli et al. |15j. The main technical difficulty is in the analysis 
of Phase I's iterated randomized procedure. 

Our analyses often bound the running time in terms of A, which can be significantly larger than 
the arboricity A. We give a new reduction that, roughly speaking, reduces the maximum degree 
to A • 2^°^' " in 0(log^~'^ n) time, for any e G (0, 1). This allows us to achieve sublogarithmic (in n) 
running times using algorithms that depend logarithmically on A. 

Organization. Section [2] introduces some terminology and notation. Our MM, MIS, and (A + 1)- 
coloring algorithms are presented in Sections 3}|5j Section [6] presents a reduction from graphs with 
small arboricity to small degree. In Section [7 we give an improved MIS algorithm for trees (A = 1) 
and girth > 6 graphs, and correct a small flaw in the analysis of [S^. Appendix |A] restates some 
standard concentration inequalities. 



2 Preliminaries 

All logarithms are base 2 unless noted otherwise. The input graph is G = {V,E). For any V' C V, 
let G{V') be the subgraph of G induced by V . Let Tciv) = {u \ {v, u) e E} and degQ{v) = \TGiv)\ 
be the neighborhood of w in G and its cardinality. Let Tg{v) = Tg{v) U {v} be the neighborhood 
including v. Let A = A(G) = max^gy deggr(w) be the maximum degree. Let distG{u,v) be 
the length of the shortest path (i.e., distance) between u and v in G. The diameter of G is 
maxu^vev distG(n, v) and the weak diameter of a subgraph G{V') is the maximum distance between 
l^'-vertices with respect to G, that is, max{distG(^i, v) \ u,v G V'}- In a directed graph the indegree 
(outdegree) of v is the number of edges directed to (from) v, and the degree of v is the sum of its in 
and outdegree. A forest is an acyclic graph. An oriented forest is a directed forest in which each non- 
root has outdegree 1; a pseudoforest is a directed graph in which all vertices have outdegree or 1. 
The arboricity A = A(G) is the minimum number of edge-disjoint forests that cover E. By the Nash- 

Wihiams theorem [25], A(G) is equivalently expressed as max | r |v-^G^v-^)|^ii l ■ ^' ^ V^, |y | > 2|, 
i.e., roughly the edge-density of any subgraph G{V') with at least 2 vertices. Many other measures 
of graph sparsity are, for our purposes, equivalent to A, such as degeneracy or k-core number. 

In our analyses we use several standard concentration inequalities due to Chernoff, Janson, and 
Azuma-Hoeffding. See Appendix ^ or [7] for statements of these theorems. 



3 An Algorithm for Maximal Matching 

The Match procedure given below is a generalized version of the Israeli- Itai MM algorithm [llj. 
(See also j33j.) It is given two vertex sets Ui, U2 (not necessarily disjoint) and a matching M, and 
returns a matching on Ui x U2 vertex-disjoint from M. 

Match(?7i,[/2,M) 

1. Initialize directed graphs Fi = (Ui U [/2, 0) and F2 = {Ui U U2, 0). 

2. Each V E Ui\V{M) chooses a neighbor u G U2\V{M) uniformly at random and includes 
{v,u) in E{Fi). (Note: Fi is a pseudoforest.) 
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3. Each u G C/2 with indeg^^(n) > chooses the v' £ {v : {v, u) £ E{Fi)} with maximum node 
ID and includes {v',u) in E{F2). (Note: F2 consists of directed paths and cycles.) 

4. If d.egp^{v) = 2 then v chooses a bit b{v) € {0, 1} uniformly at random. Otherwise h(v) = 
(respectively, 1) if v is at the beginning (resp., end) of a path in F2. 

5. Return the matching {{v,u) G E(F2) : b{v) = and b{u) = 1}. 

Note that Ui and C/2 are allowed to contain matched vertices since these are specifically excluded 
in step 2. Phase I of our maximal matching algorithm consists of a sequences of G(logA) stages. 
In the pseudocode below Mi is the matching M just before stage i, Vi = V\V{Mi) is the set of 
unmatched vertices before stage i, and degj and Fj are the degree and neighborhood functions 
w.r.t. G{Vi). Define the parameters 5i,Ti, and as 

_ AVcilnn _ 2A _ _ ^i'^i 

pVcilnn /9 ' 2 

where ci is a sufficiently large constant and p 1.06 a constant to be determined precisely later. 
Define Vl° = {v e Vi : degj('i;) < Tj} and = {v e Vi : degj('u) > Si} to be the low and high 
degree vertices at the beginning of stage i. In each stage i we supplement the current matching Mj 
first with a matching on V^'° x V^^^, then with a matching on Vi. 

Phase I: Initialize Mq and execute stages 0, . . . , C2 log A — 1. 
Stage i: 

1. Mj+i ^MiU Match(y/°, V^') 

2. Mi+i Mi+i U Match(Fi,yj) 

Phase II: Let C be the connected components induced by V^jiogA with size at most log^n. 
Deterministically compute a maximal matching M{C) on each C G C and return McjiogA U 

The algorithm always returns a matching. If, at the beginning of Phase II, C contains all 
connected components on V^jiogA then the returned matching is clearly maximal. Thus, our goal 
is to show that with high probability, after Phase I there is no connected component of unmatched 
vertices with size at most log^ n. In the proof below deg(S') is short for ^^eS deg(^t) for S dV. 

Lemma 3.1 Let I he any index for which ti > cslnn for a sufficiently large constant C3. Then for 
all i G [0,1], degi_^_i{v) < 6i and deg\^^{v) '= degi_^_i{Ti^i{v)) < Vi with probability 1 — l/poly(ra). 

Proof: The two calls to Match in stage i are intended to maintain the two claimed properties: 

that f 's degree degrades geometrically in each round and that the sum of w's neighbors' degrees 

degrades geometrically. The proof is by induction on i; the base case is trivial. For the sake of 

minimizing notation we use degj,Fj, etc. to refer to the degree and neighborhood functions just 

before each call to Match in stage i. Consider a vertex v £ Vi at the beginning of stage i. By the 

(2) 

inductive hypothesis degj(f) < 6i-i and deg) (v) < Vi^i, from which it follows that v can have at 
most i^i-i/Ti = ^^^^ = 5i ■ (p72) neighbors not in If degi{v) > Si (i.e., v G y>^) then in the 
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first call to Match, v will be matched with probabilit}^ 1 - (1 - l/Ti)(i-^'/2)<5« > i _ ^{i-pV2)ci inn/2_ 
By a union bound all vertices in V^^ are matched with probability at least 1 — l/n'^'^^^~^ /2)/2-i _ 
l/poly(n). 

We now argue that after the second call to Match, degl_^^^{v) < Vi. Call a node chosen in the 
Match procedure if it has positive indegree in Fi. A node v will be guaranteed to have positive 
degree in F2 if it is chosen or if it chooses an edge {v, u) and u has indegree 1 in i^i, i.e., u has no 
choice but to put {v, u) in F2. Once in a path or cycle in F2 the probability that v is matched is at 
least 1/2. 

We evaluate the edges chosen by V^-vertices for Fi sequentially, beginning with all vertices 
outside of Ti{v), then to each vertex in Ti(v) one at a time, in descending order of node ID. (Recall 
that these were used for tie- breaking in Match.) Let u £ ^iiv) be the current neighbor under 
consideration. If at least degj(u)/2 neighbors of u are currently unchosen (by vertices already 
evaluated) then place u in set A, otherwise place u in set B. If u was put in set A and u does 
choose a previously unchosen neighbor (implying that it has positive degree in F2) then also place 
u in set A' . 

(2) (2) 
We first analyze the case that degi(A) > deg\^'{v)/2 > Ui/2. (If degr'(^;) < Ui there is nothing 

to prove.) Observe that each vertex u, once in A, is moved to A' with probability at least 1/2, and 

if so, contributes degj(«) < 6i to degj(j4')j^ The probability that after evaluating each u G ^i{v), 

degj(^') is less than half its expectation is: 

Pr[deg,(A') < iE[de&(A')]] < Pr[deg,(^') < ldeg,{A)] 



-^^P(~ 2EL(deg,W)^ ) {Corollary [A]!]} 

^^^P(-M(4?OTW) {deg.(n)<5a 
<exp(-^^) 

<exp(-^Ti) {degi{A)>Ui/2 = 6iTi/'i} 

^ ^^^^~64pWci Inn) 



^Note that since Vi° (1 Vj^^ — 9, Fi consists of stars and F2 consists of non-adjacent edges, all of which are added 
to the matching. 



Note that this process fits in the martingale framework of Corollary A. 4 Ifere Xj is the state of the system after 
evaluating the jih neighbor ii of i; and Zj is deg;(it) if u joins A' and otherwise, which is a function of Xj. Thus, 
each Zj has a range of at most 5i. 
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We proceed under the assumption that degj(^') > jdegj(^) > fi/8. Since each vertex with positive 
degree in F2 is matched with probabihty at least 1/2, E[degj^;^(^')] < ^degi{A'). Moreover, 
whether v £ A' is matched depends only on the bits selected by its neighbors in F2, that is, the 
dependency graph of these events has chromatic number x = 3. Thus, 



Pr[deg,+i(AO > I desM')] < «^p(-^SSS)F) {TheoremEJ} 



/^^2 



<exp(-^^) 

< exp(-^^) {deg,(A') > Ui/8 = 6^/16} 

< exp(- 



24 16 > 
A 



The second case to consider is if degj(S) > deg.^''(t')/2 > i/j/2. Note that E[degi^i{B)] < | degj(i?) 
since at least half of i?-vertices' neighbors are matched with probability at least 1/2. By another 



application of Theorem A. 2 we have 



Pr[deg,+i(i?) > I deg,(i?)] < exp(-^^^^g|)^) {Theorem[Aj} 

\2 



^^M-m idet!{Byi)Sf ) {x = 3,degi{u) <6i} 

< exp(-^f ) {deg,(i?) > 1/./2 = din/A} 



exp( 



A 



['2') 7 ('2') 

By a union bound, deg\_^:^{v) < | deg^^ (v) for all v £ V with probability 1 — nexp(— 0(Tj)). We 
set p = \/8 /7 ^ 1.06, concluding the induction. □ 
implies that after / = 0(ln(A/ In^^^ n)) stages the maximum degree is at most 



Lemma 



3.1 



61 = (ci lnn/2)T; = 0{ln^ n). Lemma 3.2 implies that by the end of Phase I all surviving connected 
components have size poly(logn). 

Lemma 3.2 At any point in Phase I, if the maximum degree in the graph induced by unmatched 
vertices is A, then for some constant C4, after C4log A stages all connected components of unmatched 
vertices have size A^logn, with probability 1 — l/poly(n). 



Proof: The observations made in Lemma 3.1 imply that in each call to Match(Vi, Vi), each u loses 
a constant fraction of its neighbors (either because they are matched or u itself is matched) with 
constant probability. Moreover, the event that this occurs (a success for u) is independent of the 
success or failure of any u' at distance at least 5 from u. 

We use the approach of [6l [29] to show that no components with size > log n survive C4 log A 
stages. Consider a subgraph H of G with s vertices. One can easily see that there is some 
Vo{H) C V{H) with \Vo{H)\ > s/A^ = t such that for all u,u' G Vo{H), dist{u,u') > 5 and 
dist{u,Vo{H)\{u}) = Sj^ Such a set Vq{H) corresponds to a tree with size t in the graph = 
{V, {{u,u') I dist(n, u') = 5}), which has maximum degree less than A^. There are fewer than 



^For example, repeatedly select a vertex u in V{H) at distance 5 from some previously selected vertex, then 
remove all vertices within distance 4 of u. 
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4* distinct trees on t vertices and fewer than n ■ A^* ways to embed a tree on t vertices in G^. 
For any one vertex the probabihty that it is not ehminated is at most the probabihty that it is 
not successful 0(log A) times after C4logA stages, which can be made A"'^^ for any C5 by making 
C4 sufficiently large. Since Vb(-ff)-vertices are at distance at least 5 from each other, these events 
are independent and the probability that H survives C4logA stages is at most A"'^''*. By a union 
bound, the probability that any such H survives is at most n • 4* • A^*~'^^* = l/poly(n) for t = log n. 

□ 

Theorem 3.3 A maximal matching can be computed in 0(logA + log^logn) time w.h.p. in an 
arbitrary distributed network. 



Proof: Lemmas 3.1 and |3.2| imply that after the C2logA stages of Phase I, for a sufficiently large 
constant C2, all connected components of unmatched edges have size s = 0(log^ n) with probability 
1 — l/poly(n). Phase II is implemented with the deterministic maximal matching algorithm of [lOj. 
which runs in O(log^s) = O(log^logn) time. □ 



4 Maximal Independent Set Algorithms 

To compute an MIS efficiently we employ the same general strategy used in our MM algorithm. 
We run a randomized algorithm (a variant of Luby's in this case) for a certain amount of time then 
argue that the connected components in the graph induced by vertices with degree at least A/2 
have weak diameter 0(-^log n) (variant 1), or if A = poly(logn), have size 0(poly(logn)) (variant 
2). In the first case we use the trivial 0(weak diameter) = 0{\/log n) MIS algorithm and in the 
second we use the Panconesi-Srinivasan [28j algorithm, which runs in 

20(Viogiogn) ^ijj^g^ Applying 

this halving algorithm log A times reduces the maximum degree to zero. Since O(logAvTogn) is 
not an improvement over the logarithmic time MIS algorithms for A > 2^'°s", we assume in this 
section that A < 2^^. 

Phase I of Halve computes independent sets /q = ^ A ^ ■ ■ ■ ^ -^k and Phase II computes an 
MIS on the components of high-degree (> A/2) vertices in y\r(lK). In stage i the active vertices 
are Ai = l/\r(/j) and degj and are the functions with respect to Ai. 

Halve — Phase I: 

Initialize Iq and execute stages 0, 1, . . . , k — 1 where k = c^yjlogn (variant 1) or cg log A 
(variant 2). 

Stage i: 

1. Each V ^ Ai selects itself with probability 

2. Jj+i ^ liU {v \v is the only vertex in Ti{v) that selects itself}. 
Halve — Phase II: 

Let U = {v £ Afi : deg^(t>) > A/2} be the set of active vertices with degree at least A/2. 
Let C be the set of connected components of G{U) with weak diameter less than 5-v/logn 
(variant 1) or size less than A^logn (variant 2). Deterministically compute an MIS /(C) for 
each C G C and return U UceC ^i^)- 
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Lemma 4.1 Let S Ai be such that dist(n, u') > 5 and degj(u) > A/2 for all u,u' G S. The 
probability that S C Ai^i is less than p''^' where p = 1 — (1 — e~^/^)e~^ ~ 0.85. 

Proof: Whether a vertex becomes mactive in stage i depends only on the choices of vertices within 
distance 2. The distance-5 constraint ensures that the events {u E Ai^i}u£S are independent, so 
it suffices to show that Pr[M G ^j+i] < P- Let {u = vo,vi,V2, ■ ■ ■ , Vd} = Ti{u) be the set containing 
u = vq £ S and its active neighbors vi, . . . ,V(i and let j be the first index such that Vj selects 

itself in stage i. The probability that j exists is 1 — (1 — ^^)°''''^ > 1 — e~^+^ > 1 — e~^/^, and 
if it exists it will join Ij+i if none of its neighbors select themselves. By definition vq, . . . 




Figure 2: The vertex vj is the first among {u = vq, . . . ,?;^} to select itself. The probability that 
all neighbors of vj will not select themselves is: (1 — 1/(A + l))ri(fj)\{fo,---,fj-i}^ 

have not selected themselves, so the probability that vj joins the MIS (and makes u inactive) is 

(1- aTt)'^'^^'^^^'"'""''''"'^' ^ (1- aTt)^ > T^"^' ^Au^Ai+i] > (l-e-i/2)e-i. See Figure|2| 
for an illustration. □ 

In order to prove that U induces components with weak diameter less than 5\/Togn it suffices to 
prove that for each u,u' £ V at distance at least 5(-v/logn— 1), every path from u to u' contains some 
vertex not in U. To that end we define V to be the set of all paths (not necessarily shortest) between 
pairs of vertices at distance at least 5(\/logn — 1). We first claim that each P = {ui, . . . , Ur) G V 
contains a Q{P) = {gi, . . . , g^/i^^} C V{P) such that dist(u, u') > 5 for all n, n' G Q{P)- We 
generate Q{P) one vertex at a time maintaining the invariant that dist{qj,Ur) > 5(-v/Iogn — j). 
Define qi = ui and once qj is known, define qj+i = Uj. where k is the maximum index such that 
dist(gj,tifc) = 5. It follows that for all / < j, dist(gj', gj+i) > 5. By the triangle inequality, 
disi{qj,Ur) > b{-\/\ogn — j) implies dist(gj+i, Ur) > b[-\/\ogn — (j + 1)). 

Define Q = {Q{P) \ P £V} and W to be the set of all walks of length exactly 5(\/logn — 1). 
Every Q{P) G Q can be mapped injectively to a walk in W by taking the concatenation of arbitrary 
shortest paths between successive vertices in Q{P). Thus, |Q| < |W| < nA^*^^'°s 

Lemma 4.2 After k = ce \/log n (or k = celogAj stages of Phase I, U induces components with 
weak diameter less than Sylogn (or size less than A^lognj w.h.p., for a sufficiently large cq. 

Proof: If the weak diameter criterion is violated then there is some P G V with V{P) Q JJ . By 



Lemma 4.1 , in each stage i that all (5(P)-vertices have degree at least A/2 the probability that none 
become inactive in stage i + 1 is exp(— r2(|(5(P)|)) = exp(— r2(\/logn)). Thus, after At iterations the 
probability that Q{P) C [/ is e = exp(— ri(-v/Iogn) • k) = exp( — r?(c6 l ogra)). By a union bound, 
the probability that any Q e V has Q(P) ^ U is e\Q\ < enA^^Viogn-i) ^ ^^6 (-3^^^,^ ^ < 2^1^)^ 
which is l/poly(n) for sufficiently large cg. It follows that no P £ V has V{P) ^ U with probability 
1 — l/poly(n) and that Phase II successfully makes inactive all vertices in U. 
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The analysis of the second variant of the algorithm follows that of Lemma 3.2 Each sub- 
graph with s vertices contains a vertex set with size t > s/A^ that forms a tree in = 
{V,{{u,u') I dist(u,n') = 5}) with size t. There are at most n4*A^(*~^) trees embedded in 
with size t. If each of the t vertices has degree A/2 in stage i, by Lemma 4.1 the probabil- 
ity that all t are active in stage i + 1 is exp(— r2(t)) and the probability that all are in U is 
e = exp(— O(tK)) = exp(— r2(c6t log A)). By a union bound the probability that a component with 
size s = tA'^ exists in U is less than en4*A^(*~^\ which is l/poly(n) for t = logn and sufficiently 
large cq. □ 



Theorem 4.3 An MIS can be computed in 0(log A-v/log n) time w.h.p. in an arbitrary distributed 
network, or in exp(0(-^log log n)) time w.h.p. when A = poly(logn). 



Proof: The first bound follows from Lemma 4.2 and the fact that an MIS /(C) for C S C can be 



computed in 0{\/log n) time in Phase II as follows. The vertex v £ G with minimum ID ascertains 
G{G) in 0{yJ\ogn) time, selects an arbitrary MIS, and broadcasts its choice to all vertices in C, 
also in 0{y/\ogn) time. The second bound follows from Lemma 4.2 by applying the Panconesi- 

exp(0(-v/log log n)) time 
□ 



Srinivasan algorithm in Phase II, which runs in exp(0( Y^log(A^ logn)) 
for A = poly (logn). 



5 A (A + l)-Coloring Algorithm 

Schneider and Wattenhofer |32j presented a randomized (A + l)-coloring algorithm running in 
0(log A + ylogn) time and several faster 0(A)-coloring algorithms assuming A = il(logn). Here 
we give a faster (A + l)-coloring algorithm running in 0(log A + exp {0{yJ\og\ogn))) time, which 
also implies that a graph can be 0(A)-colored in exp {0{\/\og log n) ) time, for any A|^ 

Theorem 5.1 A {IS. + 1)- coloring can be computed in 0(log A + exp(0(-v/log log n))) time w.h.p. in 
an arbitrary distributed network. 

We analyze the following natural randomized coloring algorithm p[3]. Let ^ = {1,...,A + 1} 
be the palette. Let q : F — )• ^' U {_L} be the partial coloring before the ith stage, where _L 
indicates no color, and let Ti{v) = {n G r(f) | Ci{u) =-L} be the uncolored neighborhood of v. In 
the ith stage each colored vertex retains its color and each uncolored vertex v selects a color c'{v) 

uniformly at random from its available palette '^i{v) =^ ^'\{c(u) | u G r(w)}. It sets Ci+i{v) = c'{v) 
iic'{v) {c'(n) I u G Fiiv)}. 

From v's point of view a color q £ ^i{v) is available if no neighbor of v selects q. We will call 
V happy if at least |^'j(w)|/8 > (degj(f) + l)/8 of its colors are available. The probability that a 
happy V becomes colored in stage i is therefore at least 1/8. 

Let ^~^(g) '= {u G ^i{v) \ q G ^i(n)} denote the neighbors of v vying for color q and let 

w{q) *== X]ue*-i((j) l/l^i(^)l be the weight on color g, i.e., each u distributes l/|^j(it)| units of 
weight to each color in its palette. Note that since degj(ti) > 1 for u G Ti{v), |^'i(u)| > 2 and 
therefore w{q) < |'I'~^(g)|/2. We can bound the probability that a particular color q is available as 
a function of w{q). 

^If A > log 71 use Schneider- Wattenhofer ^32^; if A < logn use our algorithm. 
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Pr[g is available to v] = JJ (^l - 



~ V''^ ~ I 'I' -1(g) I ) {min. when '^'^^{q) contribute equal wt.} 



> ar^'^ {w{q)/\^-\q)\ < 1/2} 



The expected number of colors available to v is therefore at least 

min y > min {"^iMl ■ "'(9')/dog,(i>) {minimized when all weights equal} 



> 1*^(^)1/4 {T.g'Hq')<deg,{v)} 



(deg,(t;) + l)/4 



If the number of available colors is at least half its expectation then v is happy. Since the 
indicator variables for the availability of colors are negatively correlated]^ Theorem A.l bounds 



the probability that v is not happy at exp(— 2(degj(?;)/8)^/degj(f )) = exp(— degj(u)/32). Let 
d* = 32c7lnn, for any desired constant cy, and let V* = {v £Vi\ degj(t') > d*}. It follows that all 
of V* is happy with probability 1 — l/n'^''~^; call this event H. 

Let deg|(f) = {u £ ^i{v) \ u G V*} be the number of neighbors of v with degree greater than 
d* . Conditioning on we evaluate whether each u E (J'iiv) H V*) is colored in stage i. The 
probability that u is colored depends on previously evaluated neighbors of v but is nonetheless 



always at least 1/8, due to H. By Corollary A. 4 the probability that deg*_,_;^(f ) > deg*(w) is at 



most exp(— (deg*(?;)/16) /(2deg*(w))) = exp(— deg|(f )/512). In other words, after r = log;^g/x5 A 
stages, no vertex has hl2d* uncolored neighbors each with d* uncolored neighbors, with probability 
1 — l/n^'^~^^^\ This does not imply any upper bound on the maximum degree, but we can reduce 
the remaining coloring problem to two problems with bounded maximum degree. Let C V^j^i 



be the vertices with degree less than d* . Following the proofs of Theorems 3.3 and 4.3 it is easy 
to show that after 0(logd*) = O (log log n) more stages of coloring, the connected components 
induced by uncolored vertices in l/< have size {d*)'^\ogn = poly(logn), and can therefore be 
colored deterministically in exp ( O ( \/log log n) ) time. The remaining uncolored graph necessarily 
has maximum degree 512(i*, and, similarly, will decompose into uncolored components of size 
(d*)^logn after 0{\ogd*) = O(loglogn) more stages of coloring. In total the time is 0(logA + 
exp ( O ( v^oglogn ) ) ) . 

6 Bounded Arboricity Graphs 

Recall that a graph has arboricity A if its edge set is the union of A forests. In the proofs of 



Lemma |6.l| and Theorem 6.2 deg^/(n) is the number of edges incident to u in £" C £J and 
degy/(u) = deg(j(y/u{u})('u) is the number of neighbors of n in y C V. 



^See Dubhashi and Ranjan [S] for an elementary proof that all balls-and-bins problems of this type give rise to 
negatively correlated variables. In this case the bins are colors and the balls are the color choices of ii's neighbors. 
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Lemma 6.1 Let G he a graph of m edges, n vertices, and arboricity A. 

1. m < An. 

2. The number of vertices with degree at least t>\ + l is at most \n/{t — X). 

3. The number of edges whose endpoints both have degree at least t > A + 1 is at most Xm/{t — X). 

Proof: Part [T] follows from the definition of arboricity. For Parts [2] and [sj let U = {v \ deg{v) > t} 
be the set of high-degree vertices. Since \E{U)\ < X\U\ (by Part [T]) and the sum of degrees of 
[7-vertices is at least t\U\, it follows that \E{U,V\U)\ > {t — 2X)\U\. Observe that the number of 
edges that cross between U and y \ C/ is at least ^^jg^(t — deg(/(tt)). Also, the number of edges 
\E{U) \ induced by C/ is ^ J2u&u '^^Sui'^)- follows that the overall number m of edges in the graph 
satisfies 

m>J2ii- deguiu)) + ^ ^ deg^(n) > t ■ \U\ - \E{U)\ > (t - A) • \U\. 

Hence Xn > m > {t — X) ■ \U\, and so \U\ < Xn/{t — A), proving part [2j Part [s] follows since the 
number of such edges is at most X\U\ < Xm/[t — A). □ 

Theorem 6.2 Let G he a graph of arboricity X and t > max{A^, (4(c + 1) Inn)'''} be a parameter. 
In O(log^n) time we can find a matching M C E(G) (or an independent set I C V{G)) such 
that with probability at least 1 — 1/n'^, the maximum degree in the induced graph G{V\V{M)) (or 
G{V\t{I))) is at most tX. 

Proof: In 0(logj n) rounds we commit edges to M (or vertices to /) and remove all incident edges 
(or incident vertices). Let G be the graph still under consideration before some round and let 
Ti = {v £ V \ degQ{v) > tX} be the remaining high-degree vertices. Our goal is to reduce the size 
of 7^ by a roughly f^/^ factor. Let J = {v € Ti \ deg-^(f) > tA/2}. It follows that any vertex 

V £ n' = n\J has degv\n{v) > *A/2. Let E C E(n',V\n) be any set of edges such that for 

V G Ti' , deg^{v) = tX/2 (that is, discard all but tA/2 edges arbitrarily) and let S = {u \ v £ 
%' and (u, n) € E} be the neighborhood of %' with respect to E. Note that |5| < tX\l-L'\/2. See 
Figure ^ We define bad 5-vertices, bad -E-edges, and bad 'H'-vertices as follows, where /3 = t^/''. 




Figure 3: Good 5- vertices have fewer than /? neighbors in %' and fewer than neighbors in S. 
Good ?^'-vertices have at least tA/4 good neighbors in S. 

Let Bs = {u £ S \ deg^(n) > /3 or degs{u) > /^^j, let = {{u,v) £ E \ u £ Bs}, and let 
B-H' = {v£n'\ deg^^^_{v) < At/4}. 
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By Lemma 6.1 3) the number of bad {u,v) G Bg. due to deg^(n) > /3 is at most \\E\/{P 



A) < X{tX\'H'\/2)/{f3 — A). By Lemma 6.1 2) the number of additional bad {u,v) G Bj^ due to 



degsiu) >f3^ is at most {/3 - 1)X\S\/ {^^ - X) < {(3 -l)X{tX\n'\/2)/{lf - X) since there are at most 
A|5|/(/?^ — A) such u and each contributes fewer than (5 edges in E. In total \B^\ < X^t\%'\/(5. 
Note that a bad v £ Ti' must be incident to more than tA/4 edges in B^ since deg^(t>) = tX/2. 
Hence < \B^\/{tX/A) < 4.X\'H'\//3 = AX\n'\/t^/'^ . 

Our goal now is to select some vertices for the MIS (or edges for the MM) that eliminate all good 
Ti' vertices, with high probability. Each vertex u G S\Bs selects a random number and joins the 
MIS if it holds a local maximum. The probability that u joins the MIS is l/(deg5\^^(u) + l) > 1//?^ 
and this event is clearly independent of all u' G S\Bs at distance (in S\Bs) at least 3. Note 
that since the maximum degree in the graph induced by S\Bs is less than the neighborhood 
of any good v G 7i'\B-}{' contains a subset of at least {tX/4)/f3'^ vertices, each pair of which is 
at distance at least 3 with respect to S\Bs- (Such a set could be selected greedily.) Thus, the 
probability that no neighbor of w G V-'XBu' joins the MIS is at most (l-l//32)*V{4/3'') < g-u/(4/36) ^ 

e"*^''^^/^ < Thus, with high probability all good T-L'\B-}i/ vertices are eliminated. Any 



remaining high degree vertices must be in or -B-^', which, by Lemma 6.1 2) number at most 
X\n\/{t/2-X)+4:X\n'\/t^/'^ < 5X\n\/t^/'^. The number of high-degree vertices is reduced by a 
factor since t > maxjA*^, (4(c + l)lnn)''}, so after ©(log^n) rounds all high-degree vertices have 
been eliminated, with probability at least 1 — 

In the case of MM we want to select a matching that matches all T-L' vertices. Each u G S\Bs 
chooses an edge {u,v) G E\B^ uniformly at random and proposes to v that {u,v) be included in 
the matching. Any v G H^By^i receiving a proposal accepts one arbitrarily and becomes matched. 
A good V G 'H'\B-}ii has at least deg^\^^_ {v) > tX/A neighbors u G S\Bs with deg^^^, (u) < /?, so 

the probability that v receives no proposal (and remains unmatched) is less than (1 — 1//?)*^/^ < 
g-ts/'^A/4 ^ o{l/n'^^^). As in the case of MIS, the number of high-degree vertices is reduced by a 
^^^(1) factor in 0(1) time. (For the MM problem our proof could be simplified somewhat since edges 
inside S play no part in the algorithm and need not be classified as good or bad.) □ 

Theorem 6.3 Given a graph of arhoricity X, an MM can he computed in time on the order of: 

min I log A + y^logre, log A + A + log log n | 

for all X, and in O (log A + ^^^) time for X < logi-f^(i) log n. 



Proof: The second and third bounds follow from Theorem 3.3 by substituting for [lU] the deter- 
ministic MM algorithms of Barenboim and Elkin [3] for small arboricity graphs. Their algorithms 
run in 0( ^J°j^g^ ) time on graphs with size s and arboricity A < log^~^*-^^ s and in time 0(A + logs) 
in genera l. In the context of o ur algorithm, s < log^ n. The first MM bound is a consequence of 
Theorem 6.2 and Theorem 3.3 We reduce the maximum degree to A = At = A • max{A^ 2^^} 
in O(log^n) = 0{^/logn) time and find an MM of the resulting graph in 0(log A + log^logn) = 
0(logA + \/logn) time. □ 

Theorem 6.4 Given a graph of arboricity X, an MIS can be computed in time on the order of: 
min I log Ay^logn + log^^^ n, log^ A + A log A + A*^ log log n, log^ A + A"*^^^ log A + log A log log n | 

time for all X and e > 0, and in O (log A (log A + iogfogfo'g„ )) time for X < log^^^'^^^'> logn. 
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Proof: The first bound is a consequence of Theorem |6.2| and Theorem 4.3 since we can reduce the 
maximum degree to A = 2'°^^'^*"poly(A) in 0{log^^^ n) time and then find an MIS of the resulting 
graph in 0(log AyTogn) = 0(log A-y/Iogn + log^'''^ n) time. 

To obtain the second and third bounds we use the fact that a c-coloring can be converted to an 
MIS in 0(c) time. In Phase II of the Halve algorithm from Section[4]the remaining components have 
size at most s = A^logn. We run one of the deterministic /(A)-coloring algorithms of Barenboim 
and Elkin on each component. We use a different palette for each call to Halve, so these coloring 
routines can be pipelined, i.e., Phase I of the next iteration of Halve can start immediately. After 
log A calls to Halve every vertex is either in the MIS under construction, adjacent to an MIS vertex 
(and therefore inactive), or colored one of /(A) log A colors. Thus, an MIS can be found in an 
additional /(A) log A time. For /(A) = 0(A) and /(A) = A^^"*^, respectively, the [5] algorithms run 
in O(A'^logs) time and O(logAlogs) time. 

The fourth bound reflects the use of the Barenboim-Elkin [3] MIS algorithm in Phase II of 
Halve, which runs in 0(log s/ log log s) time if A < log^/^'^^^) s. □ 

Theorem 6.5 Given a graph of arboricity A and any fixed e > 0, a (A + X^'^'')- coloring can be 
computed in 0(log A + log Aloglogn) time and a (A + 0{X)) -coloring can be computed in 0(log A + 
A*^ log log n) time. Consequently, a (A + \)-coloring can be computed in time 

0(log A + min{A^"^'^ + log A log log n, A + A'^ log log n}) 

Proof: Using a palette of A + 1 colors, we reduce the size of uncolored components to s = 
poly(logn) in 0(log A) time, as in Section[5| then run one of the Barenboim-Elkin [5] deterministic 
/(A)-coloring algorithms on each component, which takes O(logAlogs) time for /(A) = A^^"*^ and 
O(A'^logs) time for /(A) = 0(A). In an additional 0(/(A)) time we can convert this coloring to a 
(A + l)-coloring. □ 

7 MIS in Trees and High Girth Graphs 

The MIS algorithm of Metivier et al. [23] works as follows. In each round each remaining vertex 
chooses a random numbeij^ and includes itself in the MIS if it holds a local maximum, thereby 
eliminating it and its neighborhood from further consideration. Observe that the probability that 
V joins the MIS in a round is l/(deg(u) + 1), irrespective of the degrees of its neighbors. 

Lemma |7.1| both generalizes a lemma due to Lenzen and Wattenhofer |19j (from trees to 



high-girth graphs) and corrects a small flaw in their claim and proof. Lemma 7.1 is weaker 



than [TD| Lemma 4.8] by a log log n factor, which provides an O ( -v/log n log log n) bound on the 



0(min{Vlognloglogn, log A log log A + , '°f 'T " „ } ) ■ 



running time of [19], rather than the claimed 0{yJ\og n log log n) . Theorem 7.3 improves this to 

log log n 
log log log 1 



Lemma 7.1 Let G = {V,E) be a graph with girth greater than 6 and maximum degree A and let 
Gi be the remaining graph after i ■ c log log n iterations of the Metivier et al. MIS algorithm. Then 
for all V ^ V and integer 5 > 1, \{w G ^i{v) \ degi{w) > A/2*~'^}| < max{A/2*"'"^, c' logn} with 
high probability, where c and d are constants, and Pj and degj are short for Tq^ and degQ. . 



^°The analysis supposes that vertices choose reals from any common distribution. This can be simulated, for our 
purposes, by selecting an integer from [1, . . . , n''] uniformly at random, for a sufficiently large constant b. 
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Proof: The lemma holds trivially when i < 1. Assume, inductively, that the lemma holds for 
i — 1 and 6 > 1. Consider an arbitrary neighbor w of v with degj^_i{w) > A/2*~^. According 
to the hypothesis, \{w' G Ti_i{w) \ degi_^{w') > A/2(*-i)-'5}| < A/2'-'^+^ < A/T-^+'^. Thus, 
w has at least A/2^~^^^ neighbors (beside v) with degree at most Due to the lower 

bound on girth, the neighborhoods of w's neighbors intersect only at w. The probability that 
w is eliminated is at least the probability that some neighbor w' with deg^_i{w') < A/2*~^~'^ 
joins the MIS. Consider the random numbers r(-) selected by Ti-i{w) in one iteration of the MIS 
algorithm, and let N = {w' E Ti-i{w) \ deg,i_i{w') < A/2*~^~'^ and r{w') > r{w)}. The events 
that vertices in join the MIS are independent as they no longer depend on u;. If we condition on 
|A^| > A/2'^-'^+2, which occurs with probability at least 1/2, the probability that w is eliminated is 
at least the probability that some A^-vertex joins the MIS, which is at least 



i (l - (1 - 2'-i-VA)^/2-^^' 
> i(l-e"i/^) > 1/20 

Now consider v. If 5 > 1 then, by the inductive hypothesis, the number of neighbors with 
degree greater than A/2*~'^ = A/2(*~^)~'^''~-^) is less than max{A/2*"'"'^~^, c' logn} and we need only 
reduce this number by a factor of 4 (or less, if A/2*^^ is already less than c'logn.) Since such a 
neighbor is eliminated with constant probability in each iteration of the algorithm (independently, 
due to the girth assumption), the probability that v still has more than max{A/2*^^, c'logn} such 
neighbors after 0(1) iterations is, by a Chernoff bound, exp(— r2(max{2^, c' logn})) < l/poly(n). 

However, if 5 = 1 then the inductive assumption does not apply. We distinguish two cases: 
either \{w G rj_i(?;) | deg^_i{w) > A/2*^'^}| is greater than T = c"\nn ■ A/2*+'^ (for a suitably 
large constant c") or not. In the first case, we argue that with high probability, v is eliminated in 
one iteration of the Metivier et al. algorithm, slightly modified as follows. Each vertex x whose 
degree is in the range [A/2'-\A/2'-^-'^] = [A/2^-\ A/2*-2] independently chooses a random 
number r(x); other x implicitly choose r(x) = — oo. If x holds a local maximum it joins the MIS. 
By the inductive hypothesis v has less than A/2(*~^)'*"'' = A/2* neighbors with degree greater than 
A/2(*-i)-^ = A/2'-2, so at least T - A/2* = (c"lnn - 2)A/2*+i of v's neighbors will be choosing 
random numbers. The probability that none are local maxima (and therefore v survives) is at most 
(1 - 2*-2/A)(^" ''^"-2)^/2'+' = l/poly(n). 

In the second case v has fewer than T neighbors with degree at least A/2*"''. On the other 
hand, we can assume it has more than c' log n such neighbors, otherwise the claim is trivially 
satisfied. Each such neighbor is eliminated with constant probability in each iteration of the 
algorithm, and these events are independent due to the lower bound on girth. By a Chernoff 
bound, after O(loglogn) iterations v will have at most A/2*"'"'' such neighbors with probability 
1 - exp(-Jl(A/2*+'')) < l/poly(n). □ 



Theorem 7.2 An MIS can be computed in 0(log Aloglogn + exp(0(-v/loglogn))) time on graphs 
with girth greater than 6. 



Proof: Lemma 7.1 implies that in 0(log Aloglogn) iterations the graph is reduced to a situation 
where no vertex has c' log n neighbors with degree at least c' log n. Let L be the set of vertices 
with degree at most d log n and H be the remaining vertices. The maximum degree in the graph 



induced by H \s A' = c'logn, so, by Theorem 4.3 we can find an MIS for H in exp(0(-v/log log n)) 



time, then find an MIS for what remains of L, also in exp(0(-^log log n)) time. □ 
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Theorem 7.3 An MIS of an unoriented tree can he computed in time on the order of 

min| i/lognloglogn, log A log log A + logfogfogn ' A + log* n | , 
with high probability. 



Proof: If the graph has maximum degree greater than 2\/'°s"/ log log reduce it to 2"^^°^ 
in \/log n log log n time, using Theorem 



6.2 



In the modified Metivier et al. algorithm from Lemma 7.1 , all the critical events held with proba- 
bility 1 — exp(— r2(c' logn)) = 1 — l/poly(n). We run the Lemma 7.1 algorithm, substituting c'log A 
for c' log n and i ■ c log log A for i ■ c log log n. In order to guarantee that after i ■ c log log A iterations, 
no vertex has more than max{ A/2*^^, c' log A} neighbors with degree greater than A/2*~'^, for any 
5 > 1, we mark as bad any vertex that violates this condition and temporarily remove bad vertices 
from consideration. More precisely, redefine Ti(v) to be the neighborhood of v after i ■ clog log A 
iterations, excluding bad vertices and those covered by the MIS under construction. Since bad 
vertices are removed from consideration, we guarantee that 

\{w G Ti{v) I degiiw) > A/2'~^}\ < max{A/2^+^ c' log A} (1) 

holds with probability 1, for any 6 > 1. Given that this property is satisfied at v after i ■ clog log A 
iterations, the probability that it is also satisfied at v after (i + 1) • clog log A iterations (and is 
therefore not marked bad) is at least 1 — exp(— r2(c' log A)) = 1 — l/poly(A) for sufficiently large 
constants c and c'. (The analysis is identical to that in Lemma |7.1[ ) Thus, the probability that a 
vertex is marked bad after 0(log A log log A) iterations can be made at most 1/(8 A). 

According to Eqn. ([T]), after 0(log Aloglog A) iterations no vertex has A' = c'log A neighbors 



with degree greater than A'. As in the proof of Theorem 7.2, let H and L be the set of (non-bad) 
vertices after O (log A log log A) iterations with degree at least A' and less than A', respectively. 
Note that no vertex has A' neighbors in H. First compute an MIS on the graph induced by H, 
in 0(A' + log* n) = 0(log A + log* n) time, then compute an MIS on what remains of L, also in 
0(A' + log* n) time. At this point all non-bad vertices have been removed. Let B be the set of 
remaining bad vertices. We claim that with high probability B induces a collection of trees with 
size O(logn). With respect to the original tree, there are fewer than n ■ (4A)* distinct subtrees 
with size t. (There are fewer than 4* distinct rooted trees with size t, and at most nA* ways 
to embed such a subtree in the original tree.) The probability that any given subtree of size t 
is contained in B is 1/(8A)*. Thus, by a union bound, the probability that any subtree with 
size at least t is contained in B is at most n(4A)*/(8A)* = l/poly(n) for t = 0(logn). The 
Barenboim-Elkin algorithm computes an MIS of a collection of such trees, deterministically, in 
0(logt/ loglogt) = 0(loglogn/logloglogn) time. □ 
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Appendix 



A Concentration Inequalities 



See Dubhashi and Panconesi [7] for proofs of Theorems A.l - A.3 and related concentration bounds. 
In our applications of these inequalities we often simplify the sum Ylii^'i ~ ^i)^ ^ follows. If 
-ai)<T and maxi(a'^ -ai)<t then ^-(a'^ - aif < {T/t)f = tT. 

Theorem A.l (Chernoff with negative correlation) Let X = Xi + • • • + Xn be the sum of n 

random variables, where the {Xi} are independent or negatively correlated. Then for any t > 0; 

/ 2t^ 

Fv[X > E[X] + t],Pi[X < E[X] -t] < exp -^^-^ 

where ai < Xi < a'^. 

Theorem A. 2 (Janson) For X = Xi + • • • + Xn the sum of n random variables and t > 0: 

/ 2t^ 

Px[X > E[X] + t], Pr[X < E[X] -t] < exp 



where ai < Xi < a[ and x is the fractional chromatic number of the dependency graph Gx = 
({1, . . . {{i,j) I Xi and Xj are not independent}). 

Theorem A.3 (Azuma-Hoeffding) A sequence Yq, . . . ,Yn is a martingale with respect to Xq, . . . , Xn 

ifYi is a function of Xq, . . . ,Xi andE[Yi\XQ, . . . = y^-i. For such a martingale with bounded 

differences ai <Yi — < a[, 



Pi[Yn >Yo + t], Pr[Yn <Yo-t]< exp 



t2 



Corollary A. 4 Let Z = Zi + ■ ■ ■ + Zn be the sum of n random variables and Xq, . . . ,Xn be 

a sequence, where Zi is a function of XQ,...,Xi, Hi = E[Zi \ Xq, . . . , Xi^i], jj, = Yl,ii^i.> ^^'^ 
o,- < Zi < a';. Then 



Pt[Z > /i + t], Pr[Z < /i + < exp 



Proof: Define the martingale Yq, . . . ,Yn w.r.t. Xq, . . . , Xn by Iq = and Yi = + Zj — ^j, then 
apply Theorem A.3 Note Yn — Yq = Z — ^ and the range of Yi — Yi-i still has size a[ — ai. □ 
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